AITopics | eye-tracking data

Collaborating Authors

eye-tracking data

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

AI study gives insights into why super-recognisers excel at identifying faces

The GuardianNov-5-2025, 00:01:12 GMT

Research has suggested super-recognisers look at more areas across a face than typical people. Research has suggested super-recognisers look at more areas across a face than typical people. Research uses eye-tracking data to examine some people's extraordinary recognition ability They have been used in the search for the Salisbury novichok poisoners, finding murder suspects and even spotting sexual predators. Now, research has revealed fresh insights into why super-recognisers are so good at identifying faces. Previous research has suggested people with an extraordinary ability to recognise people look at more areas across a face than typical people.

ai system, information, retinal information, (11 more...)

The Guardian

Country:

North America > United States (0.19)
Europe > Ukraine (0.07)
Oceania > Australia (0.05)
Europe > United Kingdom > England > Dorset > Bournemouth (0.05)

Genre: Research Report > New Finding (1.00)

Industry:

Government > Regional Government (0.96)
Leisure & Entertainment > Sports (0.74)
Health & Medicine (0.71)

Technology:

Information Technology > Communications > Social Media (0.75)
Information Technology > Artificial Intelligence > Cognitive Science (0.57)
Information Technology > Artificial Intelligence > Vision > Face Recognition (0.32)

Add feedback

Window-Based Feature Engineering for Cognitive Workload Detection

Hallam, Andrew, Gayathri, R G, Lee, Glory, Sajjanhar, Atul

arXiv.org Artificial IntelligenceNov-4-2025

Cognitive workload is a topic of increasing interest across various fields such as health, psychology, and defense applications. In this research, we focus on classifying cognitive workload using the COLET dataset, employing a window-based approach for feature generation and machine/deep learning techniques for classification. We apply window-based temporal partitioning to enhance features used in existing research, followed by machine learning and deep learning models to classify different levels of cognitive workload. The results demonstrate that deep learning models, particularly tabular architectures, outperformed traditional machine learning methods in precision, F1-score, accuracy, and classification precision. This study highlights the effectiveness of window-based temporal feature extraction and the potential of deep learning techniques for real-time cognitive workload assessment in complex and dynamic tasks.

artificial intelligence, machine learning, window size, (15 more...)

arXiv.org Artificial Intelligence

2511.0106

Genre: Research Report > New Finding (0.66)

Industry: Health & Medicine > Therapeutic Area (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Cross-Enhanced Multimodal Fusion of Eye-Tracking and Facial Features for Alzheimer's Disease Diagnosis

Nie, Yujie, Ni, Jianzhang, Ye, Yonglong, Zhang, Yuan-Ting, Wing, Yun Kwok, Xu, Xiangqing, Ma, Xin, Fan, Lizhou

arXiv.org Artificial IntelligenceOct-30-2025

Accurate diagnosis of Alzheimer's disease (AD) is essential for enabling timely intervention and slowing disease progression. Multimodal diagnostic approaches offer considerable promise by integrating complementary information across behavioral and perceptual domains. Eye-tracking and facial features, in particular, are important indicators of cognitive function, reflecting attentional distribution and neurocognitive state. However, few studies have explored their joint integration for auxiliary AD diagnosis. In this study, we propose a multimodal cross-enhanced fusion framework that synergistically leverages eye-tracking and facial features for AD detection. The framework incorporates two key modules: (a) a Cross-Enhanced Fusion Attention Module (CEF AM), which models inter-modal interactions through cross-attention and global enhancement, and (b) a Direction-Aware Convolution Module (DACM), which captures fine-grained directional facial features via horizontal-vertical receptive fields. To support this work, we constructed a synchronized multimodal dataset, including 25 patients with AD and 25 healthy controls (HC), by recording aligned facial video and eye-tracking sequences during a visual memory-search paradigm, providing an ecologically valid resource for evaluating integration strategies. Extensive experiments on this dataset demonstrate that our framework outperforms traditional late fusion and feature concatenation methods, achieving a classification accuracy of 95.11% in distinguishing AD from HC, highlighting superior robustness and diagnostic performance by explicitly modeling inter-modal dependencies and modality-specific contributions. Introduction Alzheimer's disease (AD), a progressive and irreversible neurodegenera-tive disorder, represents the primary cause of dementia in older adults [1]. It typically begins with mild memory loss and gradually progresses to severe impairments in executive and cognitive functions [2]. Within the global aging population, more than 150 million people worldwide will be affected by AD or other forms of dementia [3], imposing a substantial burden on both families and healthcare systems. Early and accurate identification of Alzheimer's disease is vital to initiate interventions that may slow progression and improve quality of life. Clinically, the diagnosis of AD primarily relies on biomarker analysis, neu-roimaging techniques, and neuropsychological assessments.

alzheimer, artificial intelligence, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2510.24777

Country: Asia > China (0.29)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Health & Medicine > Therapeutic Area > Neurology > Alzheimer's Disease (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision > Face Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

Modelling the Interplay of Eye-Tracking Temporal Dynamics and Personality for Emotion Detection in Face-to-Face Settings

Seikavandi, Meisam J., Fimland, Jostein, Narcizo, Fabricio Batista, Barrett, Maria, Vucurevich, Ted, Boldt, Jesper Bünsow, Dittberner, Andrew Burke, Burelli, Paolo

arXiv.org Artificial IntelligenceOct-30-2025

Accurate recognition of human emotions is critical for adaptive human-computer interaction, yet remains challenging in dynamic, conversation-like settings. This work presents a personality-aware multimodal framework that integrates eye-tracking sequences, Big Five personality traits, and contextual stimulus cues to predict both perceived and felt emotions. Seventy-three participants viewed speech-containing clips from the CREMA-D dataset while providing eye-tracking signals, personality assessments, and emotion ratings. Our neural models captured temporal gaze dynamics and fused them with trait and stimulus information, yielding consistent gains over SVM and literature baselines. Results show that (i) stimulus cues strongly enhance perceived-emotion predictions (macro F1 up to 0.77), while (ii) personality traits provide the largest improvements for felt emotion recognition (macro F1 up to 0.58). These findings highlight the benefit of combining physiological, trait-level, and contextual information to address the inherent subjectivity of emotion. By distinguishing between perceived and felt responses, our approach advances multimodal affective computing and points toward more personalized and ecologically valid emotion-aware systems.

artificial intelligence, emotion, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2510.2472

Genre: Research Report > New Finding (0.48)

Industry: Health & Medicine > Therapeutic Area (0.94)

Technology:

Information Technology > Human Computer Interaction (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science > Emotion (1.00)

Add feedback

inspired by a core property of information processing in the brain, namely, its flexible and selective nature in the face

Neural Information Processing SystemsAug-16-2025, 00:46:54 GMT

Here, we focused on simplicity as a proof of concept.

information processing, prediction, response prediction, (14 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence (0.36)

Add feedback

Multimodal Behavioral Patterns Analysis with Eye-Tracking and LLM-Based Reasoning

Guo, Dongyang, Abdrabou, Yasmeen, Thaqi, Enkeleda, Kasneci, Enkelejda

arXiv.org Artificial IntelligenceJul-25-2025

Eye-tracking data reveals valuable insights into users' cognitive states but is difficult to analyze due to its structured, non-linguistic nature. While large language models (LLMs) excel at reasoning over text, they struggle with temporal and numerical data. This paper presents a multimodal human-AI collaborative framework designed to enhance cognitive pattern extraction from eye-tracking signals. The framework includes: (1) a multi-stage pipeline using horizontal and vertical segmentation alongside LLM reasoning to uncover latent gaze patterns; (2) an Expert-Model Co-Scoring Module that integrates expert judgment with LLM output to generate trust scores for behavioral interpretations; and (3) a hybrid anomaly detection module combining LSTM-based temporal modeling with LLM-driven semantic analysis. Our results across several LLMs and prompt strategies show improvements in consistency, interpretability, and performance, with up to 50% accuracy in difficulty prediction tasks. This approach offers a scalable, interpretable solution for cognitive modeling and has broad potential in adaptive learning, human-computer interaction, and educational analytics.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2507.18252

Country:

Asia (0.28)
Oceania > Australia (0.16)
Europe > Germany (0.15)

Genre:

Research Report > New Finding (1.00)
Instructional Material (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine > Therapeutic Area > Neurology (0.68)
Education > Educational Technology (0.68)
Education > Educational Setting > Online (0.67)

Technology:

Information Technology > Human Computer Interaction > Interfaces (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

Mind Your Vision: Multimodal Estimation of Refractive Disorders Using Electrooculography and Eye Tracking

Wei, Xin, Liu, Huakun, Hirao, Yutaro, Perusquia-Hernandez, Monica, Masai, Katsutoshi, Uchiyama, Hideaki, Kiyokawa, Kiyoshi

arXiv.org Artificial IntelligenceMay-27-2025

Refractive errors are among the most common visual impairments globally, yet their diagnosis often relies on active user participation and clinical oversight. This study explores a passive method for estimating refractive power using two eye movement recording techniques: electrooculography (EOG) and video-based eye tracking. Using a publicly available dataset recorded under varying diopter conditions, we trained Long Short-Term Memory (LSTM) models to classify refractive power from unimodal (EOG or eye tracking) and multimodal configuration. We assess performance in both subject-dependent and subject-independent settings to evaluate model personalization and generalizability across individuals. Results show that the multimodal model consistently outperforms unimodal models, achieving the highest average accuracy in both settings: 96.207\% in the subject-dependent scenario and 8.882\% in the subject-independent scenario. However, generalization remains limited, with classification accuracy only marginally above chance in the subject-independent evaluations. Statistical comparisons in the subject-dependent setting confirmed that the multimodal model significantly outperformed the EOG and eye-tracking models. However, no statistically significant differences were found in the subject-independent setting. Our findings demonstrate both the potential and current limitations of eye movement data-based refractive error estimation, contributing to the development of continuous, non-invasive screening methods using EOG signals and eye-tracking data.

artificial intelligence, machine learning, participant, (19 more...)

arXiv.org Artificial Intelligence

2505.18538

Country: Asia > Japan (0.29)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study > Negative Result (0.34)

Industry: Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (1.00)

Technology:

Information Technology > Human Computer Interaction > Interfaces (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

To MT or not to MT: An eye-tracking study on the reception by Dutch readers of different translation and creativity levels

Gerrits, Kyo, Guerberof-Arenas, Ana

arXiv.org Artificial IntelligenceApr-29-2025

This article presents the results of a pilot study involving the reception of a fictional short story translated from English into Dutch under four conditions: machine translation (MT), post-editing (PE), human translation (HT) and original source text (ST). The aim is to understand how creativity and errors in different translation modalities affect readers, specifically regarding cognitive load. Eight participants filled in a questionnaire, read a story using an eye-tracker, and conducted a retrospective think-aloud (RTA) interview. The results show that units of creative potential (UCP) increase cognitive load and that this effect is highest for HT and lowest for MT; no effect of error was observed. Triangulating the data with RTAs leads us to hypothesize that the higher cognitive load in UCPs is linked to increases in reader enjoyment and immersion. The effect of translation creativity on cognitive load in different translation modalities at word-level is novel and opens up new avenues for further research. All the code and data are available at https://github.com/INCREC/Pilot_to_MT_or_not_to_MT

artificial intelligence, natural language, participant, (17 more...)

arXiv.org Artificial Intelligence

2504.1985

Country: Europe > United Kingdom > England (0.46)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Education (0.93)
Health & Medicine > Therapeutic Area (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

Classifying Subjective Time Perception in a Multi-robot Control Scenario Using Eye-tracking Information

Aust, Till, Kaduk, Julian, Hamann, Heiko

arXiv.org Artificial IntelligenceApr-10-2025

As automation and mobile robotics reshape work environments, rising expectations for productivity increase cognitive demands on human operators, leading to potential stress and cognitive overload. Accurately assessing an operator's mental state is critical for maintaining performance and well-being. We use subjective time perception, which can be altered by stress and cognitive load, as a sensitive, low-latency indicator of well-being and cognitive strain. Distortions in time perception can affect decision-making, reaction times, and overall task effectiveness, making it a valuable metric for adaptive human-swarm interaction systems. We study how human physiological signals can be used to estimate a person's subjective time perception in a human-swarm interaction scenario as example. A human operator needs to guide and control a swarm of small mobile robots. We obtain eye-tracking data that is classified for subjective time perception based on questionnaire data. Our results show that we successfully estimate a person's time perception from eye-tracking data. The approach can profit from individual-based pretraining using only 30 seconds of data. In future work, we aim for robots that respond to human operator needs by automatically classifying physiological data in a closed control loop.

artificial intelligence, participant, time perception, (16 more...)

arXiv.org Artificial Intelligence

2504.06442

Country: Europe (0.14)

Genre:

Research Report > Experimental Study (0.93)
Research Report > New Finding (0.86)

Industry: Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

Modelling Emotions in Face-to-Face Setting: The Interplay of Eye-Tracking, Personality, and Temporal Dynamics

Seikavandi, Meisam Jamshidi, Fimland, Jostein, Barrett, Maria, Burelli, Paolo

arXiv.org Artificial IntelligenceMar-18-2025

Accurate emotion recognition is pivotal for nuanced and engaging human-computer interactions, yet remains difficult to achieve, especially in dynamic, conversation-like settings. In this study, we showcase how integrating eye-tracking data, temporal dynamics, and personality traits can substantially enhance the detection of both perceived and felt emotions. Seventy-three participants viewed short, speech-containing videos from the CREMA-D dataset, while being recorded for eye-tracking signals (pupil size, fixation patterns), Big Five personality assessments, and self-reported emotional states. Our neural network models combined these diverse inputs--including stimulus emotion labels for contextual cues--and yielded marked performance gains compared to the state-of-the-art. Specifically, perceived valence predictions reached a macro F1-score of 0.76, and models incorporating personality traits and stimulus information demonstrated significant improvements in felt emotion accuracy. These results highlight the benefit of unifying physiological, individual and contextual factors to address the subjectivity and complexity of emotional expression. Beyond validating the role of user-specific data in capturing subtle internal states, our findings inform the design of future affective computing and human-agent systems, paving the way for more adaptive and cross-individual emotional intelligence in real-world interactions.

artificial intelligence, emotion, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2503.16532

Country:

Europe > Denmark > Capital Region > Copenhagen (0.04)
South America > Argentina > Pampas > Buenos Aires F.D. > Buenos Aires (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.94)

Industry: Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (0.48)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science > Emotion (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback